Indicators Ratings Parallel =  11 ms
Indicators Ratings =  10 ms
CPU compute ratings: 1 iteration avg time: 2144 ms
 
 ---- 
cuda_malloc<d_ratings> 0.89
cuda_malloc<d_companies_map> 0.7
kernel<computeRatingsKernel> 1.18
kernelSegment<sort> 4.74
toHostMemory<d_ratings> 1.13
toHostMemory<d_companies_map> 1.16
ranking 2.11
TotalRankingTime 12.33

cuda_malloc<d_ratings> 0.23
cuda_malloc<d_companies_map> 0.08
kernel<computeRatingsKernel> 1.17
kernelSegment<sort> 1.14
toHostMemory<d_ratings> 1.11
toHostMemory<d_companies_map> 1.12
ranking 2.13
TotalRankingTime 7.28

cuda_malloc<d_ratings> 0.26
cuda_malloc<d_companies_map> 0.08
kernel<computeRatingsKernel> 1.16
kernelSegment<sort> 1.2
toHostMemory<d_ratings> 2.2
toHostMemory<d_companies_map> 2.19
ranking 2.94
TotalRankingTime 10.29

GRID K520 :  797.000 Mhz   (Ordinal 0)
8 SMs enabled. Compute Capability sm_30
FreeMem:   3908MB   TotalMem:   4096MB   64-bit pointers.
Mem Clock: 2500.000 Mhz x 256 bits   (160.0 GB/s)
ECC Disabled


